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Abstract. The fermion bag approach is a new method to tackle fermion sign problems in lattice field 
theories. Using this approach it is possible to solve a class of sign problems that seem unsolvable by 
traditional methods. The new solutions emerge when partition functions are written in terms of fermion 
bags and bosonic worldlines. In these new variables it is possible to identify hidden pairing mechanisms 
which lead to the solutions. The new solutions allow us for the first time to use Monte Carlo methods 
to solve a variety of interesting lattice field theories, thus creating new opportunities for understanding 
strongly correlated fermion systems. 

PACS. 71.10.Fd Lattice fermion models - 02.70.Ss Quantum Monte Carlo methods 



1 Introduction 

Computing properties of strongly correlated quantum sys- 
tems, especially with fermions, continues to be an out- 
standing challenge in quantum many body theory. While 
Monte Carlo methods can help in such computations, it 
can be notoriously difficult to design them due to complex 
phases that arise from quantum interference [T]. These 
complex phases and associated negative signs are essential 
to produce the novel phenomena found in such systems [1] . 
The negative signs make it difficult to identify the correct 
probability distribution to sample from. For example, if 
one is interested in computing the thermal properties of 
a physical system with Hamiltonian H at temperature T, 
the first step in constructing a Monte Carlo algorithm in- 
volves rewriting the canonical quantum partition function 
Z as a classical partition function. This means one writes 



Z = Tr(e-^/^) = 5: 12(C), 



(1) 



where C is an element from a set of classical configura- 
tions and il{C) is its Boltzmann weight. The Monte Carlo 
algorithm is then designed to sample C with probabil- 
ity proportional to Q{C). Using Feynman path integrals 
it is always possible to find classical representations ([1]) 
for quantum problems, however in many interesting cases 
f2{C) is either not positive or is not computable in poly- 
nomial time. In both cases it becomes difficult to design 
efficient Monte Carlo algorithms and we say that the clas- 
sical representation suffers from a sign problem. Solution 
to this problem involves finding a classical representation 
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with positive Boltzmann weights computable in polyno- 
mial time. Finding classical representations of strongly 
correlated quantum systems free of sign problems is an 
important field of research at the crossroads of computa- 
tional and mathematical physics. 

Classical representations of strongly correlated fermion 
systems are particularly challenging. Due to the Pauli 
principle, when two identical fermions permute the quan- 
tum amplitude picks a negative sign. The spinor compo- 
nents of a fermion can also mix and create further complex 
Berry phases. The traditional approach for constructing a 
classical representation exploits the fact that free fermion 
path integrals can be computed analytically and expressed 
as the determinant of a fermion matrix. Since fermions can 
always be envisioned as moving freely in background po- 
tentials, in the traditional approach one begins with this 
view point and integrates over all fermion degrees of free- 
dom uniformly. Thus, the quantum partition function is 
written as a sum over determinants of matrices that de- 
pend on background potentials [2111]. Since determinants 
are computable in polynomial time, if they are positive 
the fermion sign problem is absent and one can design 
Monte Carlo algorithms ^. However, there are many ex- 
amples where the determinants can be negative and the 
sign problem remains unsolved. 

The fermion bag approach is an alternative method to 
deal with fermion path integrals in lattice field theories [B] . 
Instead of integrating over all fermion degrees of freedom 
uniformly, the idea is to group them carefully and try to 
integrate over each group separately. Each group is viewed 
as a fermion bag and the fermion path integral within each 
bag is computed analytically. The result of this integra- 
tion produces the weight of the bag and as long as it is 
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positive, sign problems are absent. The fermion bag ap- 
proach is an extension of the meron-cluster approach [7j , 
but is applicable more widely. 

Fermion bags are not unique and should be defined 
wisely. Strong and weak coupling expansions can help guide 
their definition. For example at extremely strong couplings 
fermions are paired into bosons and fermion sign prob- 
lems are naturally solved in a proper set of variables. 
As the coupling becomes weak, free fermions emerge in 
small regions which can play the role of fermion bags. 
Similarly when fermions are weakly coupled, Feynman di- 
agrams suggest that fermions be viewed as propagating 
only between interaction sites which means all interaction 
sites may be defined as a fermion bag. This diagrammatic 
approach is well known in condensed matter physics and 
has become popular recently [8,9 . There are interesting 
dualities between strong and weak coupling definitions of 
fermion bags [TU]. 

The fermion bag approach has a wide variety of appli- 
cations. In particular when bosonic background fields are 
written in terms of worldline variables [llj . a new class 
of sign problems become solvable [T^[T5] . Using a vari- 
ety of examples, we will discuss some of the opportunities 
that the new solutions create for understanding strongly 
correlated fermion systems. 



2 The Fermion Bag Idea 

The Euclidean quantum partition function for a variety of 
systems containing fermions can be written schematically 
as 

Z = [[da] e"^''('") f[(hpd4>] e'^ '^'^"^ ^ (2) 

where Sb{o') is a real bosonic action, t/;,-;/' are Grassmann 
fields representing fermions and M{a) is the fermion ma- 
trix that depends on the bosonic fields a. In order to con- 
struct a classical representation useful for Monte Carlo 
methods, the traditional approach is to integrate out Grass- 
mann variables and rewrite the partition function as 



[[da] e-^"^'''^ Bet{M{a)). 



(3) 



If the determinant of the fermion matrix is non-negative 
the classical representation ([3]) is free of sign problems. 

In many problems where Det(M(CT)) is non-negative 
the fermion fields ip and V' can be rotated to a basis where 
the matrix Af (cr) takes the form 



M(cr) 



D{a) 
-D^a) 



(4) 



which we refer to as the solvable form. In such cases the 
partition function can be written as 



Z= [[da] e-'^-'M |Det(D((T))|2, 



(5) 



which explicitly shows that the representation of the par- 
tition function is free of sign problems. Whenever the 



O 



I 



Fig. 1. Grassmann integrals can be performed by grouping 
variables. The above figure illustrates four groups of Grass- 
mann variables discussed in the text. Each group is referred to 
as a fermion bag. 



fermion matrix has a solvable form ^ we say that an iden- 
tifiable pairing mechanism (in a computational sense) ex- 
ists in the underlying physics. There are interesting prob- 
lems where such a pairing mechanism is hidden and extra 
work is necessary to uncover it. For example, in the pres- 
ence of positive diagonal elements, Det (M(cr)) is no longer 
the modulus square of a single determinant, instead it can 
be written as a weighted sum of modulus squares of deter- 
minants of smaller matrices. Unfortunately, there remain 
many interesting problems where there is no identifiable 
pairing mechanism and the fermion sign problem seems 
unsolvable in the traditional approach. As we will see be- 
low, the fermion bag approach shows that sometimes the 
pairing mechanism is hidden in background fields and can 
be recovered by changing the representation of the back- 
ground fields. 

The basic idea behind the fermion bag approach is to 
group Grassmann variables creatively and perform the in- 
tegration over each group separately. This is in contrast 
with the traditional approach in which all Grassmann vari- 
ables are treated uniformly and integrated over. Consider 
for example the Grassmann integral given by 



[dipd-ip] e 



'ip M 4) 



i-Ul ^l^^^1pxr)i-U2 ■4)x2'^x^) 



i-U3 Tp^^^x^lp^.^xs) 



(6) 



where M is assumed to be in the solvable form (HI . In this 
example there are four groups of sites that can be treated 
independently. These are (a;i), (X2), {x^^x^) and all the 
remaining sites. These four groups are illustrated pictori- 
ally in Fig. [TJ We can perform Grassmann integration over 
each group without worrying about the contribution from 
the other groups. The integral on (cci) gives f/i, on {X2) 
gives U2 and on (0:3,2:4) gives C/3. The remaining Grass- 
mann integral can be also be performed easily by setting 
the Grassmann variables associated with the three other 



Shailesh Chandrasekharan: Fermion Bag Approach to Fermion Sign Problems 



groups to zero in the exponent. This is equivalent to drop- 
ping some rows and the same columns in the matrix M 
which produces a new matrix W . Thus, the above integral 
becomes 



I^UiU2U3 / [dV#] e 



-ii w ip _ 



UiU2U3J:)et{W) (7) 



Clearly, there is no guarantee that Det(M^) is positive. 
However, if M has the solvable form (|3]) as we assumed, 
then so does W and hence its determinant is also non- 
negative. Each group is referred to as a fermion bag. The 
three bags (xi), {X2) and (xs^Xi) are referred to as inter- 
action site bags since such terms usually arise from inter- 
actions, while the remaining bag is referred to as the free 
fermion bag since the contribution to it comes from the 
free fermion action. 

In the traditional approach, since all Grassmann vari- 
ables are treated uniformly, four-fermion couplings have 
to be converted to fermion bilinears by introducing auxil- 
iary fields. The above discussion suggests that this is not 
always necessary. In fact sign problems can be introduced 
through auxiliary fields and may be avoidable using the 
fermion bag idea. This is illustrated nicely through the 
following simple plaquette model on a cubic lattice. The 
partition function of the model is given by 



Z 



[dtpdtp] e' 



-■4>Mlp + Uj2p = ^^y;,i'zu'4''W i'^lp^ i'yIPy V'jV'j 



(8) 

where tpx , i^x ^^^ ^^*^ Grassmann fields and P represents 
plaquettes on the lattice. We assume that the matrix M 
connects only even and odd sites and has the solvable 
form (JH). For example it can be the free massless stag- 
gered fermion matrix. The interaction is a sum over eight- 
fermion couplings on plaquettes, each term being a prod- 
uct of four ■(/''0's, one from each site of the plaquette. In 
the traditional approach one could use the identity 



gC/-l/'l'/'lV'2'/'2'/'3'/'3V'4'/'4 _ \ g 



^{u^'^ z Eti'A.V'.) 



(9) 



to rewrite the partition function as 



[zp] 



[^p1 



Det(M + C/^/%) 



(10) 



where z is a diagonal matrix with complex elements that 
depend on the Z4 plaquette field [zp]. Since Det(M -|- 

U^'^z) can be complex, the traditional approach suffers 
from a sign problem. The fermion bag approach on the 
other hand, is free of sign problems. 

Instead of introducing auxiliary fields, in the fermion 
bag approach we can expand the interaction term as 



jjn. 



. V'lV'x 



E 

np=OS 



[u n ^.^-) 



(11) 




I I I 



n::n 
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Fig. 2. Fermion bags in the plaquette configuration. Each pla- 
quette can be considered as an interacting fermion bag, while 
the remaining sites as a free fermion bag. At large couplings 
the free fermion bag can split into many smaller bags. 



Introducing a discrete plaquette field np = 0, 1 it is pos- 
sible to write the above partition function as 



Z = 



[«p] 



E / MV'V'] e- 



■tp M V 



n {uu^xi^^rm 



p, xeP 



xeP 



xeP 



An illustration of the plaquette field on a square lattice 
is shown in Fig. [2] We can now group Grassmann vari- 
ables associated with each plaquette where Up = 1 into 
separate bags and the remaining sites which do not touch 
any plaquettes into a separate bag. Performing the inte- 
grals separately the fermion bag partition function takes 
the form 

^ = E ^^^ Det(M^(np)), (13) 

[np] 

where Np is the total number of n^ = 1 plaquettes in the 
configuration [np]. The matrix W{np) is obtained from 
the matrix M by dropping the rows and columns asso- 
ciated with sites that belong to all non-zero plaquettes. 
Since W{np) also has the solvable form dU, Det(VF[n]) > 
and the sign problem is absent. 



3 New Class of Solvable Models 

While the plaquette model discussed above provides a sim- 
ple example to demonstrate the usefulness of the fermion 
bag idea, the full advantage of the method to solve sign 
problems emerges in the presence of dynamical bosonic 
fields. In this section we explain how a new class of lattice 
field theories which contain sign problems when formu- 
lated in the traditional approach, can be written without 
sign problems in the fermion bag approach. The pairing 
mechanism, which seems absent in the traditional method, 
can be resurrected by rewriting some bosonic fields in 
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worldline variables. These new solvable models are de- 
scribed by the following general action: 



s = J2Mm{<j) 



g^{4>))^yAj + S,{a,4,), (14) 



where the index x = 1,2, ..,V labels both space-time and 
other internal indices, ip^, ^^ ^-^^ Grassmann fields, M{a) 
has the solvable form @ and {'l')xy = 4'xSx,y is a diagonal 
matrix that depends on an additional complex scalar field 
(jjx- Importantly, M{a) does not depend on </> and the cou- 
pling g is real and positive. The presence of an arbitrary 
complex diagonal matrix g<P{4>) makes Det{AI (a) + g<P{(j))) 
complex, implying that there is no pairing mechanism in 
the traditional approach and it suffers from a sign problem 
irrespective of the form of the bosonic action. However, 
this sign problem is completely solvable in the fermion 
bag approach if the bosonic action Sb{(y, (f) is constrained 
to have the solvable form as explained below. 

The bosonic action S'f,(cr, cj)) will be defined as solvable 
if the /c-point correlation function 



G[xi,...,Xk,(7) 



'Sb{(7,< 



(15) 



can be expressed as a sum over positive Boltzmann weights 
computable in polynomial time. In the next section we will 
argue that many physically interesting bosonic actions are 
solvable. Writing = p e*^ we will show that for these 
theories we can write 



G{xi,...,Xk,cr) 



E 

[b] 



[dp] f2{b,a,p,n), 



(16) 



where [b] is a worldline configuration of charged particles 
described by integer valued bond variables on the lattice, 
[n] is a configuration of monomers associated to the k sites 
a;i,a:2, ■■■,Xk and J7 > is the Boltzmann weight calcula- 
ble in polynomial time. Such worldline representations for 
bosonic field theories have become popular recently [111 

Assuming that Sb{(y,<j)) is solvable let us now prove 
that the class of lattice field theories with action (fT4|) can 
be written without sign problems in the fermion bag ap- 
proach. We expand the interaction term at each lattice 
site X as 



-a4>^'^:ci>^ 



-5</'a;'0x^^)"" 



(17) 



n^=0,l 



where n^ = 1(0) is a monomer field that indicates the 
presence (or absence) of a monomer. After introducing the 
monomer field [n], the partition function takes the form 

Z = Y, f[d<j][d(j>] e-^''^*'"^ 

[# #] e-^*^(^)^ Y[{-g(^x^x^xr^ (18) 



Assuming that a;i,X2, ..., Xfc are the sites where Ux 
the partition function can be written as 



Z 



j\MY.9'{j 



m 



-Sb{4>,'^) 



(t'x.j 



[d^ d^] e-V'^^('^)'^(-V',^V.J-(-V'.>.j}. (19) 



The term within the first curly braket is the fc-point cor- 
relation function defined in ([T5]). Let us assume that we 
can use (|16p to express it as 



-Si,(0,o-) 



E 

[b] 



[dp] f2{b,a,p,n). 

(20) 

The second curly braket is the fc-point fermion correlation 
function and we use the fermion bag idea to compute it. 
We divide the Grassmann integral into fc -t- 1 groups, one 
for each of the k interaction sites and one group that in- 
cludes all the remaining free sites. The integral over each 
of the k sites is trivial and gives the identity. The remain- 
ing Grassmann integral is simply the determinant of the 
(V — k) X (V — k) matrix W{(j, n) obtained from the matrix 
M{a) by dropping the rows and columns that belong to 
the k monomer sites. If M{a) has the solvable form Q so 
does W{a,n), which means Det(W^(cr, n)) > 0. Hence the 
fc-point fermion correlation function is given by 



[d^d^] e-'^^^W'^ (_^^^^,J...(_V;^^V.J 

Det{W{(T,n)). (21) 

Combining dH]), (gHl), and (EU we obtain 

Z= f[dadp] Y, /■ ^ib,a,p,n) Det(W^(a,n)), (22) 

[n,b] 

which is free of sign problems. 

Instead of using the fermion bag idea we can also use 
Wick's theorem to compute the correlation function in 
(|2T|) . This leads to a sum over Feynman Diagrams which 
can be performed exactly. One obtains 



[j[d^d^] 



~i>M[a]i! 



{-i'x^i^x,)■■■{-i'x^^Xk)} 



Det(M(cr)) Det(G'(cr,n)), (23) 



where G{a, n) is the k x k propagator matrix connect- 
ing the fc monomer sites xi, ...Xk- This diagrammatic ap- 
proach has been exploited in condensed matter physics [S] 
[U]. Combining (PT|) and (E^ . we can derive the duality 
relation 

Bet{W{a)) =Det(M(CT)) I)et{G{a,n)) . (24) 

which connects the diagrammatic approach to the fermion 
bag idea jlOj . From a computational point of view, when 
fc is large the left hand side is easier to calculate and when 
fc is small the right hand side becomes easier. 
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4 Solvable Bosonic Actions 

In the previous section we defined a bosonic action Sb{a, (j)) 
to be solvable if the fc-point correlation function of the 
complex field (j) could be expressed as a sum of positive 
Boltzmann weights computable in polynomial time. In this 
section we argue that many standard actions are solvable 
in this sense. 

Assuming (p = p e^^ let us first argue that the general 
bosonic action of the form 

Sb{cr, 4>) = s{a, p) - ^ Pxy cos{£x6^ + SyOy) (25) 
x,y 

is solvable if s((T, p) is a real function and Pxy > are non- 
negative real numbers for all values of x and y. Ex — ±1 is 
an arbitrary Ising field whose precise form is not relevant 
for the moment. Substituting the above action in P3|) we 
obtain 



where a; is a lattice site on a hyper-cubic lattice and a is 
the unit vector along each direction. The standard choice 
also implies /3, A > 0. Correlation functions of (j) determine 
the solvability of the action. Expressing the complex field 
in polar form (</) = pe )) we obtain 



S{ct^)^Y.^npl + \pi)^ 






Ppxp. 



'x+a 



cos 



TX+: 



a) (31) 



G{xi,...,Xk.,cr) 



[dp] e 



-s((T,p) 



Pxi---Pxk 



X f[de] Y[ e^-'''=°"(^-^-+''''^«) e'^="i ... e'^^'fc (26) 
Using the identitjQ 



ly cos(£^Sx+ei 



'"' = E ^^..(/3-.)e 



ib:^y{e:r0^^eyey) 



b-ci/ — — OO 



(27) 
on each bond (x, y) and identifying bxy as an integer cur- 
rent variable on bonds we obtain 



G{xi,...,xk,<j) = E / t'^^] 

lb] -^ 



c-^^-^^^p,,...p,. 



(xy) X y 

(28) 

where we have defined b as the current (or worldline) con- 
figuration, n as a monomer field such that Ux = 1 aX the k 
sites xi, a;2, .., x/j and n^: = on other sites, and computed 
the integral. Comparing with (J16p we can identify 

/2(6,a,p,n) ^e-^^-'") p., ... p.,, ([] h.yifixy)) 

(xy) 

X (n ^{'^x+^ex{bxy + hyx)y{2^) 

X y 

Since /?(n, a, p, n) is non-negative and computable in poly- 
nomial time, the bosonic action (|25p is indeed solvable. 

Many standard bosonic actions can be expressed in 
the form of ([25]) . As a first example, consider the stan- 
dard complex scalar field theory with quartic coupling on 
a lattice whose action is 

5(0) = E('*l'^-I' + ^I'^-l") ~ f^Y.^'l>-^*x+S. + <t>x+arx) 

X x,a 

(30) 



which is indeed of the form (|25l) . 

As a second example we consider an 0(4) invariant 
scalar field theory involving four real fields denoted as 
ip = (cr, tt) whose lattice action is given by 

Sb{v) = E |^(yx ■ ya:) + >^{^x -ifxYj-Py^^^x- 'Px+a, 

X X^OL 

(32) 
For reasons that will become clear later, we choose to iden- 
tify (j) = — *(7ri — 1^2) as the complex scalar field whose cor- 
relation functions determine the solvability of the model. 
Assuming (f) — pe we can rewrite the action as 

Sb{a,TT3,(t)) == s{a,TT3,p) - ^l3pxPx+aCOs{9a; - 9x+a) 

X,OC 

(33) 
where 

s(a, ^3, p) = ^ {K{al + ttI + pI) -f X{al -f tt^ + plf) 

X 

— P/^ [C'xCx+a + Trz.xTr^.x+a) (34) 

x,a 

Again clearly ([55]) is of the form (OSl) . 

As a final example consider a non-relativistic scalar 
field theory described by two complex scalar fields ipi (r, t) 
and <y92(r, i) where r labels the spatial coordinates and t 
the temporal coordinate. The lattice action of interest is 
given by 

56(</?i,¥'2) = E'*il'^i(^'*)l^ + '''2l'^2(r,i)P) 



r,t 






{t,t'),r 



/3mj {^^ (r, t)^* (r, t') + if* (r, t)^, (r, t')), (35) 



where (r, r') and {tt') denote the nearest neighbor sites 
in space and time respectively. In this model, arbitrary 
correlation functions of both complex fields (^i and ip2 de- 
termine the solvability of the action. We can combine both 
fields into a single complex field (px by defining that the 
index x spans space-time lattice points two times. Assum- 
ing V is the total number of space-time points, we can 
define (px — </'i(r,i) while (px+v — ¥'2(1", i). Then correla- 
tion functions of cpx will determine the solvability of the 
model. Assuming (p^ = PajC*®^ the action takes the solvable 
form (I25l) 



^ Ib{P) is the modified Bessel function 



Sb{(p) = E '^='Px ^ E l^^yP^Py cos(6'2; - Oy). (36) 



x,V 
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if Kx and P^y are defined appropriately in terms of ki, K2, 
fis,ij and Pt^ij- 



staggered Dirac operator and by denoting the complex 
field on even sites as Lpe and tliose on tlie odd sites as (po, 
in an appropriate basis ([37]) can be written as (J14p wliere 



5 New Opportunities 

The new solutions to fermion sign problems discussed in 
Sect. [3] create new opportunities to understand strongly 
correlated fermion systems. In this section we construct 
three classes of models in different areas of physics that 
were intractable earlier due to sign problems, but thanks 
to the new solutions they can now be used to address 
interesting questions. 



5.1 Quantum Criticality with Staggered Fermions 

It is well known that massless fermions can become mas- 
sive due to strong interactions. Quantum critical points 
often separate massive and massless phases. In 3 + 1 di- 
mensions, critical points are rare and gauge fields play an 
important role in creating them |17j . On the other hand in 
2 + 1 dimensions many critical points are known to exist 
even in the absence of gauge fields |TB]. In either case, un- 
derstanding their properties is an exciting field of research 
with applications both in particle physics P^f^Q] and con- 
densed matter physics [21,. Since these critical points do 
not occur in a perturbative regime, Monte Carlo methods 
are essential to compute their properties. 

Properties of critical points in 2 + 1 dimensions can 
be studied either through four-fcrmion models or Yukawa 
models [^1^3, . Effects of long range interactions and dis- 
order are also interesting to explore [24l[25l[26l[27] . Unfor- 
tunately, traditional Monte Carlo methods can only be 
used in a small subset of these models where sign problems 
are solvable [28l|29l[3(71[3T|f32 . 33 , 34 . 35 36 37 . 38 . 39 . 40 . A 
large number of equally interesting models have remained 
unexplored due to sign problems. Fortunately, some of 
these can now be studied using the fermion bag approach. 

Consider iV-fiavors of staggered fermions denoted by 
Grassmann fields Xx ii Xx,i, * = 1, 2, .., iV, interacting with 
a complex scalar field (j)x — p^e*^^ through a Yukawa cou- 
pling in three space-time dimensions. The lattice action is 
given by 



s = SbW 



2_^ Xx, 






9 



x.i 



!^^ X " X -. , 



XxaX^ 



(37) 

where x, y denote the sites on a cubic lattice, {xy) de- 
notes nearest neighbor sites, £x = (—1)^1+^2+2:3 assuming 
a;i,X2,a;3 denote the three coordinates of the lattice site 
X. We assume the coupling g to be real and positive. The 

massless lattice staggered Dirac matrix Dxy is given by 



xy 



\^ 



^1,2,3 



,a I *^a;+d,y Ox — OL^y r l"^j 



with -qx^i = 1,?7:e,2 = (-1)^\?7: 



a:, 3 



(-1) 



3:1+3:2 



denoting 



M + g ^(0) 



9Ve 



-D^ 



D 



gfo 



(39) 



Here D is the sub-matrix of the full matrix D'^'*^ that con- 
nects odd sites with even sites. Since Det(Af + 5 <?) can 
be complex in general, the models described by (I37|) suffer 
from sign problems in the traditional approach. However, 
if the bosonic action is solvable and for example takes the 
form (see ^) 

SW = ^{kpI + Xpt) - Y^ PxyPxPy COs{exOx + EyOy), 

X x,y 

(40) 
these models are free of sign problems in the fermion bag 
approach. 

Models described by (1371) possess a rich phase diagram 
especially when the freedom in the bosonic action is ex- 
ploited. There are massless and massive fermion phases 
and the critical points that separate the two have prop- 
erties that can depend on N and the symmetry that is 
broken. By choosing the bosonic action carefully we can 
change the symmetries of the model. For example if Pxy is 
non-zero only between even and odd sites, then it is easy 
to verify that the action (|37| is invariant under the U{1) 
chiral symmetry 



j{e^-ip) 



(41) 
One the other hand if we allow Pxy to be arbitrary, the 
action (j37p is invariant only under a Z2 symmetry 



ie^7r/2 



i'x. ^ e--^/^V.,», i^x. ^ e"^"'>.,^, e"- ^ e 



^ie^7r/2 



the staggered fermion phases. Using the properties of the 



(42) 

It is also easy to introduce various kinds of disorder and 
long range interactions through the bosonic action and the 
Yukawa couplings. 



5.2 Pairing Interactions with Hamiltonian Fermions 

The idea of pairing plays a central role in our understand- 
ing of superfluidity and superconductivity. It has a vari- 
ety of applications from materials physics |41) to neutron 
stars [^. Many types of pairings have been discovered 
in nature. For example spin-singlet s-wave pairing is the 
simplest and leads to conventional superconductivity. On 
the other hand spin-triplet pairing is also well known and 
leads to unconventional superconductivity with many in- 
teresting properties [13] . The well known high Tc materials 
contain d-wave pairing and continue to be interesting even 
today ;44 . 

From a computational point of view, pairing also plays 
an important role in solutions to sign problems. As we al- 
ready pointed out in Sect.[51 if the fermion matrix has the 
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solvable form ^ then a pairing mechanism (in a compu- 
tational sense) exists in the model. In a dense fermion sys- 
tem such pairing mechanisms can easily be lost especially 
when fermions interact with bosonic fields. While QCD at 
finite baryon densities is a famous example of this feature 
of dense fermion systems, many simpler models exhibit 
similar features. The fermion bag approach is able to un- 
cover hidden pairing mechanisms and hence is able solve 
new sign problems. Below we construct the simplest class 
of such models with hidden s-wave pairing. With some 
effort it should be possible to extend these ideas to pair- 
ings in other channels. In order to demonstrate the wide 
applicability of our ideas, here we choose to work with 
Hamiltonian lattice fermions. 

Consider the free Hamiltonian of a spin-less fermion 
given by 

^ = II'^r' ^r'.rCr (43) 



Although the action (H5|) seems to contain pairing interac- 
tions the complex scalar field hides it. Indeed the fermion 
determinant in a fixed scalar field background can be com- 
plex and the models are intractable in the traditional ap- 
proach due to sign problems. By rewriting the bosonic 
variables in the world line representation one can resurrect 
the lost pairing mechanism in the fermion bag approach. 
This solves the sign problem. 

To see that (HS)) can be written as P^ we perform the 
following transformations 



Xt.ra ^ "'/'t,r,l> Xt^r.l ^ -1pt,r,2, 
Xt,r,2 -^ V't,r,l, Xt,r,2 ^1pt,r,2- 



(49) 



In addition using the fact that D^ = D\ we can verify 
that (Uni) indeed takes the solvable form (|14p where in the 
2x2 space of the fermion species we get 



where Cr and cj are fermion annihilation and creation op- 
erators at the lattice site r and -ffr',r is the one-particle 
operator that describes fermion hopping on the lattice. It 
is well known (see '45' for a recent review) that the cor- 
responding partition function can be written as a Grass- 
mann integral on a space-time lattice, whose action is 



S 



E 

t',r',t,r 



Xt'. 



Dtr 



■;t,r Xt,i 



(44) 



where D is the non-relativistic matrix given by 

Dt',r';t,r = -Sf + l,tSr' ,r + (^t'.t (^r',r - £-f^r',r)- (45) 

Assuming Lt time slices, the temporal continuum limit is 
obtained by taking e — ^ and Lj — > oo while keeping 
eLt = 1/T (the inverse temperature) fixed. Finite den- 
sities can be introduced with a chemical potential that 
enters through the Hamiltonian |45) . 

With this notation consider two species of fermions 
labeled with Grassmann fields XtriJ Xt,r,i, « = 1, 2 which 
interact with two complex scalar fields ipi and ip2- The 
action of is given by 

S = Sb{ipu'P2)+ Yl Xt',r'.i Dt',r':t,v Xt,r,z 
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E ('^2(r, t)xt.r,lXt,r,2 + Vl (l", i)Xt,r,2Xt,r,l) (46) 



i',r',i,r,i 



where Sb{^i, ^2) will be assumed to be solvable and could 
take the form p5D . The action is invariant under SU{2) 
transformations 



where U G SU{2) and U{1) fermion number transforma- 
tion 



Xt,r,^ -^ e'" Xt.r,t, Xt,v,i ^ Xt,r.i 



M + g <P 



9Vi D 



(50) 



(48) 



Since 5b(iy9i, (/?)2) is solvable, the partition function can 
be expressed without sign problems in the fermion bag 
approach. 

Traditionally, the above class of models have been stud- 
ied only in the limit where the bosonic fields behave like 
auxiliary fields. When they are integrated out one ob- 
tains the attractive Hubbard model, which can be for- 
mulated without sign problems. However, when bosons 
remain dynamical traditional methods cannot solve the 
associated sign problems. Fortunately, since the fermion 
bag approach is free of sign problems, one can use these 
models to understand the physics of pairing in more gen- 
erality. By exploiting the freedom in the choice of bosonic 
actions, one should be able to explore many new regimes 
and scales that have remained unexplored. Further, as we 
explain in Sect. |6] below, it should be possible to make 
_ffr',r more interesting by introducing spin-orbit couplings 
and thus allowing us to extend the limits of possibility 
even further. 



5.3 Yukawa Models with Wilson Fermions 

Long ago in his pioneering work |46j Yukawa suggested 
that nuclear physics may be governed by a theory of mas- 
sive nucleons interacting through the exchange of pions. 
Today we understand that this approach to nuclear physics 
can be made more systematic through the use of nuclear 
effective field theory (NEFT) |47ll48j . In fact lattice formu- 
lations of NEFT have become popular recently [?9] and 
some properties of low lying nuclei have also been com- 
puted j50j . From a more conceptual point of view, while 
NEFT without pions is on a strong footing, there is dis- 
agreement on how to incorporate the dynamics of pions 
systematically [5X11521153] . Symmetries should ultimately 
determine how to incorporate them and it is possible that 
a non-perturbative framework is necessary. 
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It is interesting to explore if lattice field theory models 
that contain the same symmetries and low energy degrees 
of freedom as NEFT, can help us understand how to in- 
corporate the dynamics of pions. Lattice Yukawa models 
are ideal starting grounds for such an exploration [51] . 
For example, in the simplest setting one could start with 
two flavors of light Dirac fermions coupled to an 0(4) 
scalar field through an SU{2)l x SU{2)r = 0(4) symmet- 
ric Yukawa coupling. In a phase where the 0(4) symme- 
try breaks spontaneously to 0(3), the low energy physics 
would contain two massive fermions and three massless 
Goldstone bosons which are precisely the degrees of free- 
dom of NEFT. Further, the couplings of the fermions to 
the bosons would be correctly constrained by the relevant 
symmetries [SF . Since it is not necessary to preserve chiral 
symmetry at the lattice scale, Wilson fermions could be 
ideal for this exploration. As we will see below, the sim- 
plest lattice field theory model with the above properties 
suffers from a sign problem in the traditional approach, 
but not in the fermion bag approach. 

The model we consider contains an isospin doublet of 
Grassmann valued four-component spinor fields XixjXi.x 
i = 1,2 on a four-dimensional space-time lattice, inter- 
acting with both an iso-scalar field a and a triplet of iso- 
vector fields tt. The action of the model is given by 

S = Sb{a, tt) -I- ^ -x^,x M^x,jy Xj,y (51) 

where 

^rxjy = D'^x"v ^ij + 9 {<^^t] + ^iBTij ■ TT^) S^y (52) 

is the fermion matrix, £)(™) is the Wilson Dirac operator, 
D^^v ^mS^y - ^{r~jf,)Sx+f,^y + {r + 'y^,)Sx~f,,y, (53) 

T are the three Pauli matrices that act on the isospin 
space, 7^ and 75 are the five gamma matrices which we 
choose to represent by 



70 



1 
0-1 

73 = 



,71 = 

(73 
CT3 



CTl 
0-1 

,75 = 



,72 = 

i 
-iO 



f C72 

\(J2 



(54) 



for later convenience and Sbicr, tt) is the 0(4) linear sigma 
model p2p . For convenience we have suppressed Dirac 
indices. Ignoring D^^' the action is invariant under an 
0(4) = SU{2) X SU{2) symmetry, but including !)('") 
only preserves the vector SU{2) symmetry. 

The above model suffers from a sign problem in the 
traditional approach. To see this let us define T — 7275T2. 
Assuming K is the complex conjugation operator it is easy 
to see that [KT, M] = and {KTf = 1. This proves that 
the determinant of M is real but not necessarily positive 
[55] . For general background fields (cr, tt), Det(M) can 
indeed be negative. On the other hand, the above model 



belongs to the solvable class discussed in Sect. [3] To see 
this, let us write the matrix M in the 2x2 isospin space, 

Dw+ g(T + ig-f^TTs. ig^si^^i - ^2) \ 
M = I , (55) 

igibiT^i + i'^2) Dyj + ga - igj^TTs J 

where we have separated the contribution from the four 
scalar fields. By redefining x = ipj5, X — *'''2'0, — i(7''i — 
11^2) = 4>, and D(cr, TTa) = 75(Du, -I- ga) + ign^, we can 
rewrite the action as (|14l) where. 



M(cr, TTa) + g <P{(P) = 



D{a,Ti^y 



-DHa,n^) gr 



(56) 



Since the bosonic action is solvable, the above model is 
free of sign problems in the fermion bag approach. 



6 More Solvable Models 

The Fermion bag approach is a general idea to perform 
Grassmann integrals. As long as one can define fermion 
bags carefully and identify symmetries (or other argu- 
ments) that ensure the positivity of fermion bag weights, 
sign problems are absent. The solutions to sign problems 
discussed in Sect. [3] considers the simplest class of models. 
As an example of a more complex situation, here we con- 
sider an exotic class of fermion models written in terms of 
two component Grassmann fields on a space-time lattice. 
They contain terms that naturally arise in the presence of 
spin-orbit couplings and can be interesting in condensed 
matter physics [571155] . 

Let ^pr,t,s, V'r t s, s = 1, 2 be Grassmann fields that rep- 
resent the two components of spin-half fermions hopping 
on a space-time lattice. We assume these fermions inter- 
act with two complex scalar fields that have their own 
dynamics. The lattice action is given by 

S = Sbiipi,(p2) + Sf{ij,'ljj) + Sint{lp, 11-^,^1,^2) (57) 

where Sbi'^1,^2) is a solvable bosonic action whose form 
is not important. 



Sfi^P, V') = - '^{lpr.t+l,s - ^r,t, JV-i 

r,i,s 



-'s.s' *-*r,r' 



/3r.r')^r',t (58) 



is the free fermion action where a^.r' is a real symmetric 
matrix of couplings, /3r,r' are three real anti-symmetric 
matrices of couplings, or are Pauli matrices that act on 
the spin space and 



Sint = £5 y] (</'2(l",i)V'r,t,2V'r,t,l + (/^l (l", 0V'r,t,l^r,t,2) 



E 

bfr,t 



(59) 

is the interaction term. For convenience we define the 
fermion matrix M by expressing Sf{ip, ip) = ^Mip. 
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It is impossible to solve the sign problem in these class 
of models using the traditional approach. However, sign 
problems are absent in the fermion bag approach. The 
reason is that the 2/c-point correlation function 



X {^r[,t[^2^r[.t[,l - 4'r'„t'„2A',,t'„l) (60) 



is positive and can be computed as a determinant exactly 
as in dH])- The proof involves the fact that [KT, M] = 
where K is the complex conjugation operator and T = 02- 
Since {KT)^ = -1 it is possible to prove that Det(M) > 
[56] . Interestingly, the same proof is also applicable to any 
matrix obtained from M by dropping rows or columns 
associated with both spin components at various lattice 
sites. Importantly, the sites where the rows are dropped 
need not be the same as the sites where the columns are 
dropped. This property ensures that all 2fc-point correla- 
tion functions ([50]) are positive. Thus, if the bosonic action 
is solvable then the above model is free of sign problems 
in the fermion bag approach. 

In comparison to models in higher dimensions, there 
are many more models of interacting fermions in two space- 
time dimensions that are solvable using fermion bag ideas. 
For example, models like the one that was recently consid- 
ered with an eight-fermion coupling [59, . can be handled 
with ease in the fermion bag approach. In one spatial di- 
mension there are many fermion matrices that have real 
and positive determinants. Indeed since fermions in one 
spatial dimension can be mapped into hardcore bosons, 
fermion sign problems can be completely eliminated in 
the world line representation in many problems |60) . In 
particular sign problems are absent at any density with 
both repulsive and attractive interactions. Quantum im- 
purity problems in higher dimensions can also be mapped 
to one dimensional problems |61) . 



7 Computational Advantages 

In addition to solving sign problems, the fermion bag ap- 
proach offers other computational advantages. For exam- 
ple, one of the bottlenecks in traditional methods is the in- 
ability to explore large system sizes even when interactions 
are weak or strong. This is because one always works with 
matrices that are as large as the system size. On the other 
hand in the fermion bag approach the size of the matrices 
one deals with is related to the size of the fermion bags 
and not on the system size. The bag sizes are usually small 
at both weak and strong couplings, which makes the com- 
putational effort rather mild at these extremes. The real 
difficulties associated with large matrices are pushed into 
the intermediate coupling region. Even there one usually 
deals with bag sizes that are only a fraction of the phys- 
ical volume, allowing one to explore large lattices with 
relative ease. Another advantage of the fermion bag ap- 
proach is that recent developments in worm algorithms 
|63j can be used to sample the configuration space. These 



algorithms can help update topological properties of a con- 
figuration that arc otherwise difficult to update. It was re- 
cently shown that a combination of worm algorithms and 
non-local updates eliminates all critical slowing down once 
the cost of computing determinant ratios is taken into ac- 
count [641 . Worm algorithms also help in measuring ojf 
diagonal observables that get contributions from configu- 
rations that do not contribute to the partition function. 



8 Results from a Four-Fermlon Model 

The fermion bag approach was recently used to study a 
four-fermion model in three dimensions containing one fla- 
vor of lattice staggered fermions. Due to fermion doubling 
the model describes Nf = 2 flavors of four component 
Dirac fermions in the continuum. The action of the model 
is given by 



(xy)GL 

XI ^XxXx){XyXv) (61) 

(xy)fEB 



x,V 



where the staggered Dirac matrix D^'^' was defined in 1 
m is the fermion mass, {xy) labels two types of bonds : 
(1) link bonds L (between nearest neighbor sites) and (2) 
body diagonal bonds B (between opposite sites of a body 
diagonal in cubes). When m — Q^ the model is invari- 
ant under an SU{2) x C/(l) lattice symmetry. The SU{2) 
symmetry arises when x ^nd x on each site are treated 
as an SU{2) doublet [65], while the C/(l) symmetry is the 
usual chiral symmetry of staggered fermions. The mass 
term breaks the U{1) chiral symmetry. 

Since the link coupling C/l looks like a current-current 
coupling, the model with [/« = is referred to as the lat- 
tice Thirring model in the literature. It has been studied 
with traditional methods using the Hybrid Monte Carlo 
(HMC) algorithm [31,32,33 . Due to difficulties in study- 
ing the chiral limit with the HMC algorithm, all pre- 
vious Monte Carlo calculations were performed at non- 
zero fermion masses. The results were then extrapolated 
to the chiral limit and critical exponents were computed 
using the usual scaling analysis. Unfortunately the pres- 
ence of two infrared scales, namely the box size and the 
fermion mass, makes such an analysis difficult. Recently, 
the fermion bag approach was also used to compute the 
same critical exponents. In contrast to the traditional ap- 
proach, exactly massless fermions were studied and lat- 
tices as large as 40^ were explored [TU]. Table [T] com- 
pares the results from various calculations performed so 
far. Although all results are consistent with each other, 
the fermion bag approach provides the most accurate re- 
sults to date. 

Thirring models in the continuum have a long his- 
tory [66 , 67 . They have been analyzed using various tech- 
niques including large Nf [68], Dyson-Schwinger equations 
[M1I701I711I72J . and renormalization group flows [73]. To 
compare lattice results with continuum results requires 
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Table 1. Comparison of all results obtained in the Nf — 2 lattice Thirring model described by (|61|) with Um ~ 0. The results 
from the fermion bag approach are by far the most accurate to date. 



Work 



Range Range 
of L of m, 



U. 



V 



Vi, 



mean field theory [Bl] - 0.25 1 1 

HMC results [32] 8-12 0.4-0.02 0.25(1) 0.80(15) 0.70(15) 

HMC results [33] 16-24 0.06-0.01 0.250(6) 0.80(20) 0.4(2) 

fermion bag Resuhs [ID] 12-40 0.2608(2) 0.85(1) 0.65(1) 0.37(1) 



much care since the fixed point structure in the contin- 
uum is quite complex. A recent continuum calculation 
finds V K, 2A and 77 « 1.5 [73j, suggesting that the con- 
tinuum Thirring universality is different from the univer- 
sality of the lattice Thirring model. Is it possible that the 
lattice Thirring model actually flows to the fixed point of 
the Nf = 2 Gross-Neveu (GN) model with a C/(l) chiral 
symmetry? 

Lattice GN models also have a long history [^1291150] . 
They can be constructed by introducing auxiliary scalar 
fields at centers of cubes. Each scalar field at the center 
of a cube couples to fermion mass terms at the corners 
|22] . In a theory with C/(l) chiral symmetry, integration 
over the auxiliary fields results in the model described by 
(|6ip with non-zero values for both I/l and C/b- Comparing 
with the lattice Thirring model the only difference is that 
C/r 7^ 0. Unfortunately, this non-zero coupling introduces 
sign problems in the traditional approach [I^. Hence it 
would have been impossible to study an Nf = 2 model 
previously. Yet, such a model with a Z2 chiral symmetry 
seems to have been studied by the authors of [28] and it 
was found that v = 1.00(4) and ry = 0.754(8). Unfortu- 
nately, the sign problem was never addressed. These re- 
sults however, are in excellent agreement with other theo- 
retical analysis [71]. It is possible to add conjugate fermion 
fields and eliminate the sign problem in the traditional ap- 
proach, but at the expense of increasing Nf by a factor 
of two. Such a modified model with a U{1) chiral symme- 
try was recently studied on large lattices and it was found 
that V = 1.03(4) and 77 = 0.91(4) [31]- 

The results for critical exponents obtained from lat- 
tice GN models so far, also seem inconsistent with Tab.[TJ 
This discrepancy is some times used as evidence to argue 
that lattice GN models and the lattice Thirring models 
belong to two different universality classes. If true, this 
leads to a disturbing conclusion that C/b ^ could drive 
the transition into a different universality class although 
no lattice symmetries change! Although traditional meth- 
ods suffer from a sign problem when Um ^ 0, the fermion 
bag approach is free of sign problems even with C/b 7^ 
|12] and can be used to compute the critical exponents. 
Not surprisingly one obtains the same critical exponents 
as in Tab. [T] suggesting that the universality class has 
not changed ^75]. Thus, we learn that the lattice Thirring 
model and the lattice GN model described by (|5T]) be- 
long to the same universality class. But which one? And 



is there a theoretical framework to understand the critical 
exponents in Tab. [T]? 

Most results from lattice simulations in GN models are 
usually compared with large N f calculations. However, it 
was pointed out in [73] that the series is poorly conver- 
gent for small values of N f. It was claimed that a better 
approach would be to perform an e = 4 — d expansion. 
Results for A^/ = 2 up to order e^ is given by [73], 

i^ = 0.5 + 0.23e + 0.12e2, n = 0.57e + O.lOe^, 

?7^ = 0.071e - 0.009e2. (62) 

Substituting e = 1 we obtain v w 0.85, r\ w 0.67 in ex- 
cellent agreement with results of table [1] However, 77^ w 
0.062 is completely off. Given the excellent agreement with 
the other two exponents, it is tempting to conclude that 
the lattice model described by (|5T|) belongs to the uni- 
versality class of the continuum N f = 2 chiral U{1) GN 
model close to the critical point. We think that by ignoring 
the sign problem in their calculations the authors of [28] 
introduced conjugate fermions unintentionally and thus 
studied an A^/ = 4 model instead of an Nf = 2 model. 

Further work is clearly needed to verify these claims. 
Given that rj^ is quite different from the theoretical pre- 
diction, it would be important to confirm it through an 
independent computation in a different model. Studies in 
GN models with a Z2 chiral symmetry would be very use- 
ful. Finally, an independent calculation of other critical ex- 
ponents could help in confirming hyper-scaling relations. 
All these are within reach of the fermion bag approach. 



9 Conclusions 

The fermion bag approach offers an alternative method to 
solve fermion sign problems. Instead of integrating over 
all Grassmann variables uniformly, they are grouped into 
bags such that Grassmann integrals can be performed sep- 
arately inside each bag. By cleverly choosing the bags and 
the background bosonic fields many new sign problems 
can be solved. The new approach also teaches us that sign 
problems can be hidden in the choice of the bosonic fields 
and solutions may require the reformulation of the entire 
problem in new variables. 

We have shown that many lattice field theories that 
are of physical interest but could not be studied until now. 
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can be formulated without sign problems in the fermion 
bag approach. This creates new opportunities in the field 
of strongly correlated fermion systems some of which we 
have outlined in this work. 

Three dimensional four-fermion models containing two 
flavors of Dirac fermions with a U{1) chiral symmetry have 
been studied recently using the fermion bag approach. The 
critical exponents at the quantum critical point between 
the massless and massive phases could be computed accu- 
rately and the results {ly = 0.85(1) and i] = 0.65(1)) are in 
excellent agreement with e expansion. On the other hand 
the anomalous dimension of the fermion field 77^ = 0.37(1) 
seems to disagree. 
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